
QUIESCENT RECONFIGURATION OF 
A ROUTING NETWORK 

Cross-Reference to Related Applications 



This application contains subject matter which is 
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Technical Field 



This invention relates in general to reconfiguration of 
a routing network and, in particular, to quiescently 
reconfiguring a routing network having one or more special 
5 nodes for facilitating reliable delivery or ordered delivery 
of messages within the routing network using logging or 
sequencing techniques employing the special node(s). 



Background of the Invention 



Many network environments enable messages to be 
10 forwarded from one site within the network to one or more 
ifl other sites using a multicast protocol- Typical multicast 

W protocols send messages from one site to one or more other 

vf* sites based on information stored within a message header. 

That is, each message has two components: the message 
U 15 header, which includes the routing information, including 
;^ destination addresses or a predefined group name that is 

y associated with a fixed list of destinations, and a data 

content, which is the data of the message. The routing 
\Q information is read from the message header and is used to 

*y 20 send the data content of the message to the specified 
destinations . 



One example of a system that conventionally includes 
such a network environment is a publish/subscribe system. 
In publish/subscribe systems, publishers post messages and 
25 subscribers independently specify categories of messages in 
which they are interested. The system takes the posted 
messages and includes in each message header the destination 
information of those subscribers indicating interest in the 
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particular message. The system then uses the destination 
information in the message to forward the message through 
the network to the appropriate subscribers. 

More particularly, a publish/subscribe system includes 
5 a network of message routers (or simply routers or nodes) 
connected via links in an arbitrary graph topology. A 
number of clients connect to the periphery of this router 
network and either publish or subscribe to messages. A 
message includes of a number of attributes, which are name- 
10 value pairs. A subscription specifies a predicate on the 
attributes of messages. 

The problem addressed by the present application is how 
to handle changes of topology in such a routing network. 
The need for changing topology arises from a number of 

15 conditions, including: (1) the need for system maintainers 
to move routers on- and off-line; (2) the growth of the 
network; and (3) changes in link speed that alter the 
optimum spanning trees of the network; etc. An assumption 
is made that a configuration manager has made a decision to 

20 effect a particular topology change (i.e., a 

reconfiguration) . The present invention is directed to a 
technique by which nodes of the router network execute a 
reconfiguration decision, eventually resulting in a new 
state of the network in which messages are forwarded using a 

25 new spanning tree. 

Disclosure of the Invention 



Briefly summarized, the present invention comprises in 
one aspect a method for reconfiguring a routing network. 
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The method includes: quiescing the routing network to 
preserve a first-in first-out ordering of data messages 
within the routing network; and reconfiguring the quiesced 
routing network. As an enhancement, the method further 
5 includes the quiescing of control messages within the 

routing network prior to the reconfiguring, and the routing 
network comprises a special network having at least one 
logging node or a sequencing node wherein messages are 
either logged to persistent storage for reliable routing or 
10 sequenced with other messages received by the network for 
ordered delivery. 



In another aspect, the invention comprises a system for 
reconfiguring a routing network. The system includes means 
1j for quiescing the routing network to preserve a first-in 

?J 15 first-out ordering of data messages within the routing 

network, and means for thereafter reconfiguring the quiesced 
routing network. 



iJJ 



In still another aspect, a system for reconfiguring a 
!*J publish/subscribe system is provided. This reconfiguration 

20 system includes a routing network adapted to receive 
■l3 published messages for forwarding to subscribers. The 

network is further adapted to quiesce data messages and 
control messages within the network in response to a 
reconfigure command to preserve a first-in first-out 
25 ordering of data messages and control messages within the 

routing network, and to then reconfigure the routing network 
once data messages and control messages have been quiesced. 



In a further aspect, the invention comprises an article 
of manufacture which includes at least one computer usable 
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medium having computer readable program code means embodied 
therein for effecting reconfiguring of a routing network. 
The computer readable program code means in the article of 
manufacture includes: computer readable program code means 
5 for causing a computer to effect quiescing the routing 

network to preserve a first-in first-out ordering of data 
messages within the routing network; and computer readable 
program code means for causing a computer to effect 
reconfiguring the quiesced routing network. 



10 To restate, provided herein is a technique for 

quiescent reconfiguration of a routing network of a 
publish/subscribe system without loss of messages. 
Reconfiguration is dynamic in that it is accomplished 
without shutting down the network, i.e., the execution of a 
'{% 15 reconfiguration is not directly visible to either publishers 
i c - or subscribers. The publishers and subscribers continue to 

i a& publish and receive messages as if no reconfiguration is 

taking place or has taken place. As used herein, quiescent 
I^j means that it is only necessary to temporarily suspend 

iij 20 message traffic within the routing network before 

'.BO? 

:Q reconfiguring the network and resuming message traffic. The 

% fi quiescent reconfiguration technique presented has particular 

application in large, continuously available broker networks 
in which change and evolution are inevitable and service 
25 disruption is not tolerable. Although principally described 
below in connection with a content-based routing network, 
the reconfiguration approach can be applied to other routing 
systems, e.g., subject-based routing systems. 



ffl 
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Brief Description of the Drawings 

The above-described objects, advantages and features of 
the present invention, as well as others, will be more 
readily understood from the following detailed description 
5 of certain preferred embodiments of the invention, when 

considered in conjunction with the accompanying drawings in 
which : 

Fig. 1 depicts one example of a distributed network 
incorporating and using the quiescent reconfiguration 
10 capability of the present invention; 

Fig. 2 depicts one example of a spanning tree used in 
accordance with the principles of the present invention; 

Fig. 3 depicts one embodiment of a distributed router 
network, configured with resilient or ordered routing 
15 capability, which is to undergo quiescent reconfiguration in 
accordance with the present invention; 

Fig. 4 depicts one embodiment of a router for use in 
accordance with the principles of the present invention in a 
router network such as depicted in Fig. 3; 

20 Fig. 5 depicts one example of an information table for 

routers of a network employing quiescent reconfiguration in 
accordance with the principles of the present invention; 

Fig. 6 is a flowchart of one embodiment of CONFIGURE 
message processing in accordance with the principles of the 
25 present invention; 
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Fig. 7 is a flowchart of one embodiment of QUIESCE 
message processing in accordance with the principles of the 
present invention; 

Fig. 8 is a flowchart of one embodiment of 
5 QUIESCE_ACKNOWLEDGE message processing in accordance with 
the principles of the present invention; 

Fig. 9 is a flowchart of one embodiment of 
QUIESCE_SPECIAL message processing in accordance with the 
principles of the present invention; 

10 Fig. 10 is a flowchart of one embodiment of 

=|1 QUIESCE_SPECIAL_ACKNOWLEDGE message processing in accordance 

j^j with the principles of the present invention; 

UJ 

V4 

Fig. 11 is a flowchart of one embodiment of 
I s - QUIESCE_COMPLETE message processing in accordance with the 

15 principles of the present invention; and 



hi 

.a* 



Fig. 12 is a flowchart of one embodiment for processing 
published messages in a routing network undergoing quiescent 
reconfiguration in accordance with the principles of the 
present invention . 

20 Best Mode for Carrying Out the Invention 

In one aspect, the present invention can be employed in 
a system using the routing capability disclosed in the 
above-incorporated application wherein messages are routed 
in a network based on the content of the data within the 
25 message. With data-content routing, a message does not need 
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to include destination information, such as destination 
addresses or a group destination name. Instead, data within 
the message is used to traverse a data structure to 
determine the link or links over which the message is to be 
forwarded in order to reach the consumers (subscribers or 
clients) interested in the message. 

More particular to the present invention, a quiescent 
reconfiguration technique is provided for, for example, such 
a content-based multicasting network. Further, the 
quiescent reconfiguration technique presented herein may be 
used in combination with, or separately from, the "reliable 
routing capability" and the "total ordering capability" 
described in the above-incorporated, co-filed patent 
applications. Whether used in combination with reliable 
routing or ordered delivery, or separately therefrom, 
quiescent reconfiguration in accordance with the principles 
of this invention may also be used with the above-described 
content-based routing network. However, those skilled in 
the art will recognize that the reconfiguration technique 
presented below is equally applicable to other routing 
systems, such as a subject-based routing system. One 
assumption underlying the reconfiguration technique is that 
links within the routing network propagate messages in a 
reliable first-in first-out manner. 

One example of a distributed network 100 incorporating 
and using the quiescent reconfiguration capability of the 
present invention is depicted in Fig. 1 and described in 
detail below. Network 100 includes, for instance, a 
plurality of computing units 102 coupled to one another by 
links 104. 



Each link couples two computing units in the network, 
and each computing unit may have any number of links 
connected to it. Each link is bidirectional, i.e., a 
computing unit may send and receive messages on the link. 
5 Each link is also connection oriented, i.e., the computer 
units at the ends of the link are notified when the link 
fails. Furthermore, each link propagates messages in a 
first-in first-out manner. Each computing unit in the 
network is either a client computer (represented by the 

10 smaller ovals, such as those having addresses 101a, 101b) , 
meaning that it either originates messages or has requested 
to receive messages whose content satisfies certain 
properties; or it is a router computer (represented by the 
larger ovals, such as 102a, 102b), meaning that it forwards 

15 messages received on one network link onto other links on 
the way to the client computer (s). -The clients are 
collectively referred to herein as clients 101 and the 
routers are collectively referred to as routers 102. (For 
purposes of this discussion, if a single computing unit 

20 serves both as a router and as a client, these two separate 
functions will be considered as two computing units 
connected by a link.) 

Each computing unit can be any type of computing unit 
that can be included in a network. For example, it can be 

25 an RS/6000 computing node or any other of various types of 
computing nodes, processors, computers or systems. The 
network can also include different types of computing units 
coupled to one another via the links. The links include, 
for instance, TCP connections over IP links, as only one 

30 example. 
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Distributed network 100 can be included in various 
systems that require the passing of messages or data. These 
systems include, for instance, the Internet or 
publish/subscribe systems (such as the above-summarized 
5 content-based messaging system) . 

Content-based publish/subscribe systems improve the 
degree of decoupling between publishers and subscribers. In 
content-based publish/subscribe systems, subscriptions are 
specified in terms of predicates on the posted data, rather 
10 than in terms of subject identifiers supplied by the 

publisher. One example of a content-based publish/subscribe 
system is described in co-pending U.S. Patent Application 
Serial No. 08/975,280, entitled "Method And System For 
j Matching Consumers To Events," Astley et al., which is 

" 15 hereby incorporated herein by reference in its entirety. 

* The examples described herein are with reference to a 

* content-based subscription system. However, these are only 
examples. The present invention can be employed with other 

1 types of systems without departing from the spirit of the 

yj 20 present invention. 



'S3 In one embodiment of the invention, each router 102 of 

network 100 (Fig. 1) has associated therewith a spanning 
tree, which lays out the best path (according to some 
criterion, such as latency) from the router to each client 
25 101. In this embodiment, it is assumed that routers agree 
on a common criterion for measuring distance between nodes 
in the network. There may in fact be multiple spanning 
trees. For example, alternative spanning trees may specify 
either backup routes, or peak load routes. Herein, it is 
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assumed that one spanning tree is initially in effect for 
the routing of any particular message. 

One example of a spanning tree, which is associated 
with a router is depicted in Fig. 2. As shown in Fig. 2, 
5 there is a path from a router 102a to every other node in 
the spanning tree. A message to be routed from router 102a 
to one or more of the other nodes is routed via one or more 
of the links associated with router 102a, i.e., links 1-3. 
For example, if a message is to be routed from node 102a to 
10 node 101c, then link 2 is used. As a further example, if a 
message is to be forwarded toward client 101a, then link 3 
is used. 

One embodiment for building a spanning tree from a 
network topology, that is an arbitrary graph, is described 
15 in detail in Introduction to Algorithms , by Cormen, 

Leiserson, Rivert, Chapter 24, pp 498-513, Published by MIT 
Press (1990), which is hereby incorporated herein by 
reference in its entirety. 

In addition to the spanning tree associated with each 
20 router, each router has a routing table. The routing table 
includes an entry for each client computer in the network. 
Each entry of the routing table associates a client address 
with the identifier of the network link constituting the 
next segment on the path in the spanning free from the 
25 router to the client. For a router with d network links, 
each such link identifier is an integer between 1 and d. 
For instance, the client having address 101a has a 
corresponding link identifier of 3 (see Fig. 2) . 
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Each routing table is constructed via information from 
the network topology (e.g., the client addresses) and hence 
from the corresponding spanning tree (e.g., the link 
identifiers), in a known manner. 

5 

By way of example, Fig. 3 depicts one embodiment of a 
publish/subscribe system, generally denoted 300, to employ 
quiescent reconfiguration in accordance with the principles 
of the present invention. System 300 includes a network of 
10 routers 302 connected via links 303 in an arbitrary graph 

topology. A number of clients connected to the periphery of 
this router network either publish messages or subscribe to 
messages. Those clients publishing messages comprise 

o 

j3 publishers 304, while those clients subscribing to messages 

HJ 15 comprise subscribers 306. The router network is responsible 
ly for routing messages from a publisher 304 to interested 

j** subscribers 306 based on matching messages to subscription 

\ sh predicates. 

y Again, this protocol (i.e., content-based routing) is 

^; 20 described in detail in the initially-incorporated co-pending 

Q 

kQ patent application entitled "Routing Messages Within A 

Network Using The Data Content Of The Message." As noted, 
from each router node at which a publisher is present, the 
system computes and stores a spanning tree to reach every 
25 other node in the network. All published messages from the 
publisher follow the paths in that spanning tree, with each 
router node performing enough matching to determine which of 
its child routers should receive the message. 



Within a publish/subscribe system, the present 
30 invention can be employed, in one example, with a network 
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achieving reliability of routed messages by saving messages 
to persistent storage within the network (and then 
retrieving and redelivering the message whenever there is a 
failure in the network) . Such a reliable routing network is 
5 described in the above-incorporated, co-filed patent 
application entitled "Message Logging For Reliable 
Multicasting Across A Routing Network." In accordance with 
that invention, publishers and subscribers that need 
reliability of messages may specify a quality of service 
10 parameter, e.g., "uniform delivery". Uniform delivery is 
provided for ensuring delivery of a message to all active 
subscribers notwithstanding failure in the network, e.g., 
the routers, or the links. One or more special routers in 
the router network 300 are designated logging node(s) or 
15 logger (s) 310 and support the ability to log messages to 
stable storage. When there is at least one subscriber 
!»* needing logging, the routing algorithm ensures that messages 

are routed to the logger. 



J7i Alternatively, or in further combination, the present 

UJ 20 invention can be employed in another example with a network 
% achieving ordering of routed messages by sequencing messages 

within the router network itself through assignment of a 
sequence number at a designated sequencing node of the 
network. Publishers and subscribers that need ordering of 
25 messages may specify a quality of service parameter called 
"totally ordered". Total ordering is provided for ensuring 
ordered delivery of a message to all active subscribers. 
Such message sequencing within the router network is 
described in the above-incorporated, co-filed application 
30 entitled "Message Sequencing For Ordered Multicasting Of A 
Message Across A Routing Network." When employing 
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sequencing, a special router 310 in the router network 300 
is designated a sequencing node or sequencer. Node 310 
supports the ability to sequence messages received into the 
routing network. When there is at least one subscriber 
5 needing total ordering, the routing algorithm guarantees 
that messages are routed to the sequencer. 

To summarize, quiescent reconfiguration pursuant to the 
present invention can be employed in a router network of a 
publish/subscribe system which utilizes at least one special 
10 node to facilitate either, or both, logging of messages or 
sequencing of messages with the network itself. 

Fig. 4 depicts one embodiment of a router node for use 
in accordance with the principles of the present invention. 
When a message arrives at router 400 it is stored into a 

15 message table 402 using a unique message identifier. 

Information stored includes a source node identification as 
well as the neighboring nodes to which the message is to be 
forwarded. These neighboring nodes are calculated by a 
content routing computation component 404 after the message 

20 has been received. Computation component 404 takes the 
message and, based upon stored subscriptions and the 
spanning tree, returns a set of destinations or links upon 
which the message should be forwarded. Again, in one 
embodiment, this computation is content dependent and can be 

25 accomplished as described in the above-incorporated co- 
pending application. However, this component can be 
implemented in other ways, e.g., subject-based routing. In 
one embodiment, to facilitate reliable routing of messages, 
router 400 would also include a reliable routing component 

30 406 and logging acknowledgment (LACK) received table 408 and 
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LACK send table 409, which are described in the above- 
incorporated, co-filed application entitled "Message Logging 
For Reliable Multicasting Across A Routing Network." 

If ordered routing of messages is employed, then router 
5 400 would also include recovery data, including a latest 

sequence number received and linked node tables (not shown) 
such as described in the above-incorporated, co-filed 
application entitled "Message Sequencing For Ordered 
Multicasting Of A Message Across A Routing Network." Again, 
10 the quiescent reconfiguration approach of this invention can 
be employed in a routing network of a publish/subscribe 
system employing data content messaging, reliable routing of 
messages, or sequencing of messages, either individually or 
in combination. 

15 Quiescent reconfiguration requires the entire routing 



network to be quiesced before the network is reconfigured 
and network operations are resumed. Since the network is 
quiesced before it is reconfigured, first-in first-out 
(FIFO) ordering is preserved between all nodes. In 
20 accordance with this invention, once quiesced, a node may 
^ not publish any new messages from one or more clients 

thereof. However, a quiesced node> must continue to route 
any data messages or control messages which it receives from 
other nodes of the network. 

25 In one embodiment, the following messages are used to 

quiesce a network in accordance with the present invention. 

• QUIESCE - sent by the root of a configuration 

change to initiate quiescence of data messages. 
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• CONFIGURE - sent by a configuration manager and 
nodes in the network to change a spanning tree 
used to route messages in the network. 
QUIESCE_ACK - sent by a node to acknowledge 
5 receipt of a QUIESCE message. Once a QUIESCE_ACK 

has been sent, a node cannot forward messages from 
clients into the routing network any new data 
messages for transmission across the network. 
However, it must continue to route all data and 
10 control messages sent by other nodes in the 

network . 

QUIESCE_SPECIAL - sent by a root to quiesce a 
special node, such as a logging node or a 

O 

,13 sequencing node. The special node responds with a 

15 QUIESCE SPECIAL ACK message. 

pi — — 

iy • QUIESCE_COMPLETE - sent by the root to indicate 
|^ that all nodes have been quiesced and the new 

iU configuration may be resumed. 

n 

y An overview of the algorithm for quiescing and 

jfif 20 reconfiguring a broker network is given next. First, a 

decision is made to reconfigure the broker network. This 
decision is outside of the scope of this description. This 
description assumes that a CONFIGURE message is communicated 
to a node in the network which becomes the root of the 
25 configuration change. For example, a Configuration Manager 
(described below) under control of some administrator might 
calculate a new configuration for the network and send a 
CONFIGURE message to the root. Parent and child nodes are 
defined in relationship to this root node. 



T3 1 
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In one embodiment, quiescent reconfiguration in 
accordance with this invention can then proceed in three 
phases. First, data messages within the network are 
quiesced. Second, control messages ( protocol messages used 
5 by special nodes, i.e., loggers and sequencers, e.g., 

LACK/SACK messages) are quiesced. Third, the network is 
reconfigured. These phases are summarized below. 

Data Message Quiescence 

The root node initiates quiescence by sending a QUIESCE 
message to all child nodes thereof using the existing 
configuration. When a node receives a QUIESCE message, the 
node begins queueing new data messages received from its 
clients, forwards the QUIESCE message to all child nodes, 
and sends a QUIESCE_ACK to its parent. QUIESCE_ACK messages 
may be coalesced at each level (i.e., a parent may wait to 
receive QUIESCE_ACKs from all of its children before sending 
its own QUIESCE_ACK to its parent), or all QUIESCE_ACK 
messages may be forwarded to the root node. Once quiesced, 
a node does not forward any new data messages from clients 
thereof. However, the node continues to forward data and 
control messages received from neighboring nodes (parents 
and children) . Also, a logging/sequencing node continues to 
produce LACK/SACK messages in response to data messages 
which it receives. 

25 Once the root node has received a QUIESCE_ACK message 

from all nodes, from the root's perspective, all data 
messages have been quiesced, and there can be no data 
messages flowing through the network towards the root. 
However, there may be data messages flowing away from the 



10 



|: '5? 



15 



20 
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root node. Also, there may be SACK and LACK messages 
flowing towards and away from the root node. 

Control Message Quiescence 

The root node next quiesces LACK/SACK messages in the 
5 network. Using the same quiesce technique outlined above, 

the root node sends QUIESCE_SPECIAL messages to any children 
sequencers and loggers. Note that if the root node is the 
only special node in the network, the control message 
quiescence phase is not necessary. When a logger /sequencer 
10 receives a QUIESCE_SPECIAL message, the node will not 

receive any new data messages on the current configuration. 
Iq When all LACK/SACK messages to any previous data messages 

N have been sent, the logging/sequencing node sends a 

i53 

ry QUIESCE_SPECIAL_ACK to the root node. After a 

H 15 logger/sequencer node has been quiesced, the node continues 
s k to forward any control messages received from neighboring 

nodes (parent and children) . 



n 



Once the root has received a QUIESCE_SPECIAL_ACK 
message from all logger/sequencer nodes, then from the 
20 root's perspective, all control messages have been quiesced 
and there are no control messages flowing through the 
network towards the root node. However, there may still be 
data and control messages flowing away from the root node. 

Reconf igura tion 

25 The root next informs all nodes that the network has 

been quiesced and proceeds to reconfigure the network. 

First, the root node sends a QUIESCE COMPLETE to all nodes 
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using the existing configuration. Because links between 
nodes are FIFO, the QUIESCE_COMPLETE message is guaranteed 
to be the last message in the existing configuration, i.e., 
no data or control message can follow the QUIESCE_COMPLETE 
message. Next, the root of the new configuration (if 
different from the old root, the old root would have to send 
a message to the new root), sends a CONFIGURE message to all 
nodes using the new configuration. When a node receives a 
CONFIGURE message, the node forwards the message to all of 
its downstream nodes in the new configuration. If a node 
receives a CONFIGURE message before it has received a 
QUIESCE_COMPLETE message on the previous configuration, the 
node must wait for the QUIESCE_COMPLETE before processing 
any new messages received on the new configuration. Waiting 
for the QUIESCE_COMPLETE message insures that all data and 
control messages have been received on the old 
configuration. Once a QUIESCE_COMPLETE message is received, 
any resources allocated for the previous configuration are 
no longer needed and may be freed. 

One detailed embodiment of quiescent reconfiguration in 
accordance with the present invention is described below 
with reference to Figs.. 5-12. Fig. 5 depicts one example of 
additional information employed at the routers of a network 
to use quiescent reconfiguration in accordance with this 
invention. For example, the table of Fig. 5 could be 
associated with or incorporated within the quiescent 
reconfiguration component 405 of the router of Fig. 4. This 
table includes (in one example) entries for five variables, 
labeled RECONFIG_ROOT , QUIESCING, CON FI GURE_RE CE I VE D , 
QUIESCE_MSG_QUEUE, and QUIESCING CHILDREN. Each of these 
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variables is described below in connection with the 
flowchart process of Figs. 6-12. 

Fig. 6 depicts one embodiment for processing a 
CONFIGURE message received, for example, from a 
5 configuration manager. As noted above, an assumption is 

made that the configuration manager independently makes the 
decision to effect a particular topology change, i.e., a 
reconfiguration. A configuration manager is a "network 
system service" that is responsible for maintaining 
10 information about the structure, function and status of a 
network system. This includes information such as the 
network topology, nodes with certain properties (such as 

O 

$0 loggers, sequencers), quality of service offered by nodes in 

%~- - 

the network, etc. A system's configuration may be 



15 statically fixed or dynamically changing. A configuration 
j B% manager may be centralized or distributed. Network system 

f<& management products such as IBM's Tivoli TME-10 contain 

L configuration management components. Again, from the 

w 

Ijj perspective of the current invention, a CONFIGURE message is 

:f:f 20 assumed to be received from the configuration manager. The 
*J3 present invention is directed to a technique by which 

'' u reconfiguration is transparent to the publishers and 

subscribers of the network, particularly where the network 
employs a special logging or sequencing node as described 
25 above . 

Referring to Fig. 6, processing commences with receipt 
of a CONFIGURE message 600 within the network. With receipt 
of this message, the node determines whether it is the root 
of the reconfiguration 605. If "no", then the CONFIGURE 
30 message is forwarded to all children nodes in the new 
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configuration 610, and the node determines whether the 
QUIESCING state is true 615. If "yes", then the 
CONFIGURE_RECEIVED indicator is set 620. Otherwise, the 
reconfiguration is complete and the node begins forwarding 
5 messages from its message queue (QUIESCE_MSG_QUEUE) using 
the new spanning tree 630. 

Returning to inquiry 605, if the node is the root of 
the reconfiguration, then the node's RECONFIGURE_ROOT 
indicator is set 640, as well as the QUIESCING indicator 
10 650. A QUIESCE message is sent to all children in the 
current configuration 660 and the QUIESCING_CHILDREN 
indicator is set to the number of children nodes in the 
current configuration 670. If the QUIESCING_CHILDREN count 
HJ is zero 680, then processing jumps to step 850 of Fig. 8 

nl 15 (described below) . 

n 

|.& Fig. 7 is a flowchart of one embodiment for QUIESCE 

^ message processing in accordance with the present invention. 

M 

y When a node receives a QUIESCE message 700, the node's 



QUIESCING indicator 710 is set, and the CONFIGURE RECEIVED 



\Q 20 indicator is reset 720. The QUIESCE message is forwarded to 
downstream routing nodes 730 and the QUIESCE_CHILDREN count 
is set to the number of downstream routing nodes 740. If 
the QUIESCE_CHILDREN count is zero, then the QUIESCE_ACK 
message is returned to the node's parent node 750. 

25 One embodiment for processing (pursuant to the present 

invention) a QUIESCE_ACK message is depicted in Fig. 8. 
Upon receipt of a QUIESCE_ACK message 800 at a node, the 
node's QUIESCING_CHILDREN count is decremented 810 and the 
node determines whether the QUIESCING_CHILDREN count is zero 
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820. If "no", then (in this embodiment) the node awaits 
further QUIESCE_ACK messages, meaning that the current 
processing is done 830. 

Once all children have responded with a QUIESCE_ACK 
5 message, the node determines whether it is the 

reconfiguration root 840. If "no", the node sends a 
QUIESCE_ACK to it parent node 845. If the node is the root 
node for the reconfiguration, then the node sends a 
QUIESCE_SPECIAL message to its children nodes having a 
10 special node in its sub-tree 850. Also, note from Fig. 8, 
that if the QUIESCING_CHILDREN count is zero in step 680 of 
Fig. 6, then processing jumps to instruction 850. After 

O 

j3 sending the QUIESCE_SPECIAL message, the node sets the 

"J QUIESCING_CHILDREN count to the number of children to which 

15 the QUIESCE_SPECIAL message was sent 860. If the 

QUIESCING_CHILDREN count equals zero 870, then processing 
U goes to step 1050 of Fig. 10. 

y Fig. 9 depicts a node's processing of a QUIESCE_SPECIAL 

p§ message in accordance- with the present invention. After a 

*J3 20 QUIESCE__SPECIAL message is received at a node 900, the node 
forwards the message to appropriate children nodes if a 
special node exists in a sub-tree from that node 910. The 
QUIESCING_CHILDREN indicator is set to the number of 
children nodes to which the QUIESCE_SPECIAL message is 
25 forwarded 920. If the QUIESCING_CHILDREN indicator equals 
zero, then a QUIESCE_SPECIAL_ACK message is returned to the 
node's parent node 930. 

One embodiment for processing the QUIESCE_SPECIAL_ACK 
message is depicted in Fig. 10. Upon receipt of a 
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.QUIESCE_SPECIAL_ACK message at a node 1000, the node 
decrements its QUIESCING_CHILDREN count 1010, and determines 
whether the QUIESCING_CHILDREN count is zero 1020. If "no", 
this processing is done 1025. Otherwise, the node 
5 determines whether it is the root node for the 

reconfiguration 1030. If "no", then a QUIESCE_SPECIAL_ACK 
message is sent to its parent node 1040. 

If the root comprises the reconfiguration root, or if 
the node's QUIESCING_CHILDREN count equals zero (step 870 of 

10 Fig. 8) , processing sends a QUIESCE_COMPLETE message to the 
node's children in the old configuration 1050. In addition, 
a CONFIGURE message is sent to the children in the new 
configuration 1060, and the RECONFIGURE_ROOT and QUIESCING 
indicators are reset 1070. The node then processes any 

15 messages in the QUIESCE _MSG_QUEUE 1080. 

Fig. 11 depicts one embodiment for processing a 
QUIESCE_COMPLETE message in accordance with the principles 
of the present invention. Upon a node's receipt of a 
QUIESCE_COMPLETE message 1100, the node resets the QUIESCING 

20 indicator 1110. The QUIESCE_COMPLETE message is then 

forwarded to its children nodes with respect to the current 
spanning tree 1120. The node inquires whether the 
CONFIGURE_RECEIVED indicator is set 1130 and, if not, 
processing is completed 1140. Otherwise, the 

25 reconfiguration is complete and the node begins forwarding 
any messages from the QUIESCE_MSG_QUEUE using the new 
spanning tree 1150. 

Fig. 12 depicts one embodiment for processing published 
messages in accordance with the present invention. Upon 
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receipt of a published message 1200, a node determines 
whether the QUIESCING indicator is set 1210. If "no", then 
the message is processed normally 1220. If in quiescing 
state, and the message is received from a local client or 
5 the current node is the reconfiguration root node and the 

message is received from a parent node, the message is added 
to the node's QUIESCE_MSG_QUEUE 1250. Otherwise, the 
message is processed normally 1240. 

Those skilled in the art will note from the description 
10 provided herein that if configuration changes are confined 
to a sub-tree of the routing network, it is possible to 
confine the above-described processing to that sub-tree. In 
such a case, the root is the parent node of the sub-tree 
under which all configuration changes occur. If the sub- 
j{j 15 tree does not contain a logger/sequencer, it is not 
j;* necessary to quiesce the logger/sequencers. In such a case, 

U the control message quiescence phase may be skipped. If the 

\^ sub-tree does contain logger /sequencer node(s), the 

y logger/sequencer nodes within the sub-tree must be quiesced. 

jfj 20 So, the control message quiescence phase would be performed. 
\Q Unlike the above description, data messages may be sent to 

the root by the parent of the root after data messages have 
been quiesced in the sub-tree. In this case, these data 
messages should be queued at either the root of the sub- 
25 tree, or at the logger /sequencer nodes (this would only be 
possible if a logger/sequencer maintained the same function 
in the new configuration) . Logging acknowledgment or 
sequencing acknowledgment messages must not be sent in 
response to these data messages until after the network has 
30 been reconfigured. 
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Also, those skilled in the art will note from the 
description provided herein that if there are multiple 
spanning trees within the routing network and if 
configuration changes are required to more than one spanning 
5 tree, it is possible to extend the above-described 
processing to effect these changes simultaneously. 

The present invention can be included, for example, in 
an article of manufacture (e.g., one or more computer 
program products) having, for instance, computer usable 
10 media. This media has embodied therein, for instance, 
computer readable program code means for providing and 
facilitating the capabilities of the present invention. The 
s '% articles of manufacture can be included as part of the 

Hj computer system or sold separately. 



15 Additionally, at least one program storage device 

readable by machine, tangibly embodying at least one program 
^ of instructions executable by the machine, to perform the 

capabilities of the present invention, can be provided. 



The flow diagrams depicted herein are provided by way 
20 of example. There may be variations to these diagrams or 

the steps (or operations) described herein without departing 
from the spirit of the invention. For instance, in certain 
cases, the steps may be performed in differing order, or 
steps may be added, deleted or modified. All of these 
25 variations are considered to comprise part of the present 
invention as recited in the appended claims. 

While the invention has been described in detail herein 
in accordance with certain preferred embodiments thereof, 
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many modifications and changes therein may be effected by 
those skilled in the art. Accordingly, it is intended by 
the appended claims to cover all such modifications and 
changes as fall within the true spirit and scope of the 
5 invention. 
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